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METHOD AND APPARATUS FOR ARBITRATING TRANSACTIONS 
BETWEEN DOMAINS IN A COMPUTER SYSTEM 

BACKGROUND OF THE INVENTION 
5 1. FIELD OF THE INVENTION 

This invention relates generally to computer systems, and, more particularly, to a 
H method and apparatus for arbitrating transactions between domains in a computer system. 
O 2. DESCRIPTION OF THE RELATED ART 

J Network computing has increased dramatically over the past several years due in part 

Lgj to the emergence of the Internet. Some trends in the industry include a significant growth in 

U Applications Service Providers (ASPs) that provide applications to businesses over networks 

O that use the Internet to distribute product data to customers, take orders, and enhance 

M, 

3 communications between employees, for example. 

La 

Typically, businesses rely on network computing to maintain a competitive advantage 
15 over other businesses. As such, developers typically take several factors into consideration to 

meet the customer's expectation when designing processor-based systems for use in network 
environments. Such factors, for example, may include functionality, reliability, scalability 
and the performance of these systems. 

One example of a processor-based system used in a network environment is a mid- 
20 range server system. A single mid-range server system may be configured for a plurality of 
operating domains, where a domain may act as a separate machine by running its own 
instance of an operating system to perform one or more of the configured tasks. 

The benefits of providing substantially independently operating system domains 
within an integrated system become readily apparent as customers are able to perform a 



2 



2070.005300/DIR 
P6769 



variety of tasks that would otherwise be reserved for several different machines. However, 
managing a plurality of system domains within a system sometimes proves to be a 
challenging task, as designers of such systems have a finite number of resources to build a 
functioning system that also meets customer demands. For example, managing one or more 

5 system domains in a system may consume valuable hardware resources that could otherwise 

be used to deliver additional features to the customers. 

To enhance the availability of the processor-based system, redundant hardware may 
be provided. If one component fails, its redundant counterpart may be configured to take 

1 over its functions in the system. If the system were configured to use only one of two 
l5 redundant components at any time until a failure occurs, the processing potential of the 
unused component would be unrealized. 

If The present invention is directed to overcoming, or at least reducing the effects of, 

. ~ one or more of the problems set forth above. 

*"* SUMMARY OF THE INVENTION 

15 One aspect of the present invention is seen in a method for communicating 

transactions. The method includes providing an interconnect having a plurality of ports for 
communicating transactions between a plurality of domains in a computing system. Each 
port is associated with a subset of the domains. The interconnect includes a first signal path 
for transmitting a first portion of the transaction and a second signal path for transmitting a 

20 second portion of the transaction. A transaction issued from a port associated with more than 

one of the domains is identified. An error in one of the first and second portions of the 
transaction is identified. The transaction is canceled responsive to identifying the error. 

Another aspect of the present invention is seen in a computing system for 
communicating transactions that includes first and second devices. The first device is 

25 adapted to receive a first portion of a transaction. The second device is adapted to receive a 
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second portion of the transaction in lockstep with respect to the first device. The first and 
second devices are further configured to identify an error in one of the first and second 
portions of the transaction, respectively, and the one of the first and second devices that 
identifies the error is further configured to send a notification signal to the device associated 
5 with the other of the first and second portions of the transaction responsive to identifying the 

error. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention may be understood by reference to the following description taken in 
1 conjunction with the accompanying drawings, in which like reference numerals identify like 
10 elements, and in which: 

Figure 1 shows a stylized block diagram of a computing system in accordance with 
C one embodiment of the present invention; 

Figure 2 illustrates a block diagram of an exemplary system domain configuration that 
may be employed in the system of Figure 1, in accordance with one embodiment of the 
15 present invention; 

Figure 3A to Figure 3E illustrate various aspects of signal paths in the computing 
system of Figures 1 and 2 in accordance with one embodiment of the present invention; and 

Figure 4 illustrates a simplified block diagram of a method for arbitrating transactions 
between domains in a computer system in accordance with one embodiment of the present 
20 invention. 

While the invention is susceptible to various modifications and alternative forms, 
specific embodiments thereof have been shown by way of example in the drawings and are 
herein described in detail. It should be understood, however, that the description herein of 
specific embodiments is not intended to limit the invention to the particular forms disclosed, 
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but, on the contrary, the intention is to cover all modifications, equivalents, and alternatives 
falling within the spirit and scope of the invention as defined by the appended claims. 
DETAILED DESCRIP TI ON OF SPECIFIC EMBODIMENTS 
Illustrative embodiments of the invention are described below. In the interest of 
5 clarity, not all features of an actual implementation are described in this specification. It will 

of course be appreciated that in the development of any such actual embodiment, numerous 
Lx implementation-specific decisions must be made to achieve the developers' specific goals, 
q such as compliance with system-related and business-related constraints, which will vary 
Jp from one implementation to another. Moreover, it will be appreciated that such a 
10^ development effort might be complex and time-consuming, but would nevertheless be a 
routine undertaking for those of ordinary skill in the art having the benefit of this disclosure. 
Z Referring now to Figure 1, a block diagram of a computing system 100 in accordance 

f=l with one embodiment of the present invention is illustrated. The computing system 100 
includes a plurality of system control boards 105(1), 105(2) and a plurality of system boards 
15 110(l)-110(n) coupled to a centerplane 115. For illustrative purposes, lines 106(1-2) and 

lines 108(1)-108(«) are utilized to show that the system control boards 105(1), 105(2) and 
system boards 110(1)-110(«), respectively, are coupled to the centerplane 115. However, as 
will be appreciated by those skilled in the art having the benefit of this disclosure, the system 
control boards 105(1), 105(2) and system boards 110(1)-110(«) may be coupled to the 
20 centerplane 1 1 5 in any of a variety of ways. These ways include, but are not limited to, edge 

connectors, cables, or other available interfaces. The system boards 110(1)-110(«) may 
interface with the centerplane 115 using the same technique as the system control boards 
105(1), 105(2), but this is not necessary to the practice of the invention. The system control 
boards 105(1), 105(2) and the system board sets 1 10(1)-1 10(n) may be connected to the 
25 centerplane 115 using different types of interfaces. 
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In the illustrated embodiment, one of the control boards 105(1), 105(2) manages the 
overall operation of the computing system 100, and the other provides redundancy and 
automatic failover in the event that the first board fails. Although invention is not so limited, 
in the illustrated embodiment, the first system control board 105(1) serves as a "main" system 
control board, while the second system control board 105(2) serves as an alternate, hot-swap, 
replaceable system control board. During any given moment, generally one of the two 
system control boards 105(1), 105(2) actively controls the overall operations of the 
computing system 100. Accordingly, the term "active system control board," as utilized 
hereinafter, may refer to either one of the system control boards 105(1), 105(2), depending on 
which board is managing the operations of the computing system 100 at that moment. Note, 
however, that some embodiments may employ only a single system control board 1 15(1). 

The centerplane 115 serves as a communications conduit for the system boards 
110(1)-1 10(h), half of which may be connected on one side of the centerplane 115 and the 
other half on the opposite side of the centerplane 115. In the illustrated embodiment, the 
centerplane 115 includes an n x n crossbar that allows up to n system boards 11 0(1)- 110(h) 
and the system control boards 105(1), 105(2) to communicate with one another, if desired. In 
one particular embodiment, n = 18. Thus, the centerplane 115 may allow the two system 
control boards 105(1), 105(2) to communicate with each other or with other system boards 
11 0(1)- 110(h), as well as allow the system boards 1 10(1)-1 10(h) to communicate with each 
other. 

Turning now to Figure 2, each system board 11 0(1)- 11 0(h), in the illustrated 
embodiment, comprise one or more boards, including a constituent system board 200, an 
input/output (I/O) board 205, and an expansion board 210. Thus, each of the system boards 
110(1)-1 10(h) is a board "set." Note that alternative embodiments might not employ board 
sets or, alternatively, employ board sets with differing numbers of constituent boards. As 
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those in the art having the benefit of this disclosure will appreciate, the functionality of one or 
more of the constituent system board 200, I/O board 205, and expansion board 210 might be 
incorporated onto a single board. Conversely, that functionality may be spread across any 
number of boards that can be interconnected in the manner shown. Thus, the number of 
5 constituent boards in the board sets 11 0(1)- 110(h) may range from 1 on up to any number. 

Again, as those in the art having the benefit of this disclosure will appreciate, there is a 
practical upper bound to this number. The upper bound generally arises from the fact that 
A greater numbers of boards tend to degrade performance, complicate serviceability, and 
2 compromise reliability. However, this is an implementation specific concern. From the 
ti standpoint of the invention, the number of boards in the board sets 1 10(1)-1 lO(rc) is 

immaterial. 

The constituent system board 200 may include processors (not shown), as well as 
memories (also not shown), for executing applications, including portions of an operating 
system. The processors may be, for example, UltraSPARC™ 64-bit processors available 

15 from Sun Microsystems, any of the Itanium™, Pentium™, or Alpha™-class processor from 

Intel Corporation, or the Athlon™ or Duron™-class processors from Advanced Micro 
Devices, Inc. In one particular implementation, the computing system 100 employs an 
UNIX™-based operating system (e.g., a Solaris™ OS) commercially available from the 
assignee of this application, Sun Microsystems, Inc. However, the invention may be 

20 employed with virtually any operating system. The I/O board 205 may manage I/O cards, 

such as peripheral component interface cards (not shown) and optical cards (also not shown), 
that are installed in the computing system 100. 

The expansion board 210, in the illustrated embodiment, generally acts as a 
multiplexer (e.g., 2:1 multiplexer) to allow both the system and I/O boards 200, 205 to 
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interface with the centerplane 115, which, in some instances, may have only one slot for 
interfacing with both boards 200, 205. 

In the illustrated embodiment, the computing system 100 is dynamically subdivided 
into a plurality of system domains, where each domain may have a separate boot disk (to 
execute a specific instance of the operating system, for example), separate disk storage, 
network interfaces, and/or I/O interfaces. System domains are a mix of I/O assemblies and 
CPU/memory resources. System domains are isolated from failures (e.g., operating system 
errors, software errors, CPU/memory failures, I/O assembly failures) in other system 
domains. Each domain may operate as a separate machine that performs a variety of user- 
configured services. For example, one or more domains may be designated as an application 
server, a web server, database server, and the like. Each domain may run its own operating 
system and may be reconfigured without interrupting the operation of other domains. The 
subdivision is handled dynamically in the illustrated embodiment, as is discussed further 
below, although the invention is not so limited. 

Figure 2 illustrates an exemplary arrangement wherein at least two domains are 
defined in the computing system 100 in the illustrated embodiment. The first domain, 
identified by vertical cross-sectional lines, includes the system board 110(n/2+2), the 
constituent system board 200 of the system board 1 10(1), and the I/O board 205 of the system 
board 110(2). The second domain in the illustrated embodiment includes the system boards 
1 10(3), 1 10(n/2+l), and 1 10(n/2+3), as well as the I/O board 205 of the system board 1 10(1) 
and the constituent system board 200 of the system board 1 10(2). 

A domain may be formed of an entire system board 110(l)-110(rc), one or more 
boards (e.g., constituent system board 200, I/O board 205) from selected system boards 
110(l)-110(rc), or a combination thereof. Although not necessary, it may be possible to 
define each system board 110(1)-110(«) as a separate domain. For example, if each system 
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board 1 10(1)4 \0(n) were its own domain, the computing system 100 may conceivably have 
up to "n" (i.e., the number of system boards) different domains. When two boards (e.g., 
constituent system board 200, I/O board 205) from the same system board 110(l)-110(n) are 
in different domains, such a configuration is referred to as a "split expander" configuration. 

5 The expansion board 2 1 0 of the system boards 1 1 0( 1 )- 1 1 0(n) keep the transactions separate 

for each domain. Physical proximity is not required for boards in a system domain. The split 
expander configuration is described in greater detail in U.S. Patent Application Serial No. 

1 09/963,082, (Atty Docket No. 2070.004700, Client Docket No. P6763), entitled "METHOD 

1 AND APPARATUS FOR PROVIDING ERROR ISOLATION IN A MULTI-DOMAIN 
l5 COMPUTER SYSTEM" in the names of Donald Kane, Steven E. Weiss, Eric Graf, and 

s Andrew Phelps. 

h= Note, however, that the invention is not limited to computing systems in which the 

formation of system domains is performed dynamically. For example, some embodiments 
may restrict the allocation of system resources to various system domains only upon reset of 

15 the entire computing system. Furthermore, resources may be allocated to system domains 

either manually or automatically, i.e., under the automated control of one or more of the 
system control boards 105(1), 105(2). The manner in which system domains are created, 
deleted and the manner in which resources are allocated or re-allocated to existing system 
domains is not material to the practice of the present invention. 

20 Returning to Figure 1, communications between system boards 110(1)-110(«) are 

performed through the centerplane 115. As shown in Figure 3 A, the centerplane 115, in the 
illustrated embodiment, includes a data interconnect 300, an address interconnect 305, and a 
response interconnect 310 for transmitting respective data, address, and response signals 
among the system domains defined from the system boards 1 10(1)-1 10(«). More 

25 particularly, the data interconnect 300 and the lines 315 comprise a data signal path; the 
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address interconnect 305 and the lines 320 comprise an address signal path; and, the response 
interconnect 310 and the lines 325 comprise a response signal path. Thus, the centerplane 
115 includes three interconnects 300, 305, 310 that provide a separate data path, address path, 
and response signal path to allow intra-domain and inter-domain communications. 

In Figure 3 A there are actually multiple point-to-point connections between the 
system boards 110(l)-110(n) and the interconnects 300, 305, 310. The signal paths 
illustrated in Figure 3A are collections of many individual signal paths. Each individual 
signal path is many bits wide. For instance, in one particular embodiment, the data path is 
288-bits wide, the address path is 148-bits wide, and the response path is 66-bits wide. Thus, 
each line 315, 320, 325 is many bits wide. Each individual signal path comprises two halves. 

In its "normal" mode of operation, the computing system 100 divides the information 
for any given transaction into two messages and transmits the two messages in parallel on the 
two halves of the respective individual signal path. For example, the payload of a data 
transaction may be split into two portions, with one payload portion being sent on each half 
of the signal path. The control information for each half of the transaction is the same for 
both halves of the transaction. In the illustrated embodiment, the control information is 
protected with parity checking and the payload information is protected with error checking 
and correcting (ECC) code to allow for single bit error corrections and double bit error 
detections. 

In accordance with the invention, the computing system 100 enters a "degraded 
mode" upon a failure of one half of the signal path. In the degraded mode of the illustrated 
embodiment, all of the information is transmitted in a single message on a single half of the 
path. While this takes twice as many cycles as in normal mode, the computing system 100, 
including that part in which the failure occurs, is still available until the fault can be repaired. 
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As is shown in Figure 3 A, the system control boards 105(1) and 105(2) are not on the 
data, address, or response signal paths. The system control boards 105(1) and 105(2) instead 
connect to the interconnects 300, 305, 310 on console connections 328(1) and 328(2), 
respectively. The system control boards 105(1) and 105(2) configure the interconnects 300, 
305, 310 over the console connections 328(1), 328(2), in a manner more fully described 
below. 

Turning now to Figure 3B, each interconnect 300, 305, 310, includes two halves 
330(1), 330(2). Each half 330(1), 330(2) is implemented by one or more ASICs (not shown) 
and has m receiving ports 335(l)-335(m) and m transmitting ports 340(1 )-340(m), where m is 
the number of expansion boards 210 in the computing system 100. Thus, if there are 18 
expansion boards, each half 330 would include 18 receiving ports 335 and 18 transmitting 
ports 340. Each of the ports 335(l)-335(m), 340(l)-340(m) is half as many bits wide as the 
individual signal path, as each half 330(1), 320(2) handles information on half the individual 
signal path. 

As seen in Figure 3C, each half 330(1), 330(2) includes an arbiter 331(1), 331(2), and 
a crossbar switch 332(1), 332(2), respectively. The arbiters 331(1), 331(2) operate in a 
lockstep manner to retain synchronicity so that the message halves will be delivered to their 
destinations in the proper order. Consider, for the sake of illustration, an individual data 
signal path 345, shown in Figure 3D, between a switch 380, on the expansion board 210 of 
the system board 1 10(1) and the switch 380 on the expansion board 210 of the system board 
110(rc). Each expansion board 210, in this particular embodiment, includes a switch 380 
through which transactions are sent and received in a manner discussed more fully below. 
Address and response signal paths are structured similarly except for obvious changes, e.g., 
the address interconnect 305 is used instead of the data interconnect 300. 
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As shown in Figure 3E, each switch 380 includes an address/response switch 350 and 
a data switch 355 that handles information received over the respective response, data, and 
address signal paths. Thus, the switch 380 handles all signal path transactions for its 
respective system board 1 10(1)-1 10(h). Note that the number of these switches in the switch 

5 380 is not material to the invention. For example, in some embodiments any number from 1 

to 3 may be employed. The response/address switch 350 and the data switch 355 are 
implemented in ASICs (not shown) that are also configurable over the console connections 

g 328(1), 328(2). 

1 Returning to Figure 3D, the individual data signal path 345 comprises a first half 

a 

1£> 360(1) and a second half 360(2). When the computing system 100 operates in a normal 

3 mode, the information for any given transaction is separated into two portions, each packaged 

^ with a common header including control information in a respective message. Each message 

o 

K is then transmitted over a respective half 360(1), 360(2) of the individual data signal path 345 

(including a respective half 330(1 ), 330(2) of the data interconnect 300) from one switch 3 80 

1 5 to the other switch 380. 

The number and type of signal paths, e.g., the data individual data signal path 345 in 
Figure 3D, is implementation specific. In the illustrated embodiment, there are three separate 
kinds of signal paths, i.e., data, address and response. There is one signal path of each kind 
between each expansion board 210 of each system board 1 10(1). Thus, there are (3 x n x ri) 

20 signal paths in the illustrated embodiment. In one implementation, n = 1, yielding a total of 

972 signal paths. However, the number may vary depending on the number of kinds of signal 
paths and the number of expansion boards 210. 

Note that, in the illustrated embodiment, the computing system 100 has a symmetrical 
physical arrangement. That is, half of the system boards 1 10(1)-1 10(«) are physically located 

25 on one side of the centerplane 115, and the other half of the system boards 1 10(1)-110(«) are 
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physically located on the opposite side of the centerplane 115. In one particular 
implementation, n = 18, so that system boards 11 0(1)- 110(9) are on one side of the 
centerplane 115 and system boards 11 0(10)- 110(18) are on the other. Additionally, in the 
illustrated embodiment, the first system control board 105(1) and the second system control 

5 board 105(2) are diagonally positioned from each other relative to the centerplane 115. The 

symmetrical positioning of the boards 105(1), 105(2), 1 10(1)-1 10(«) in the illustrated manner 
may take advantage of the interface connections on both sides of the centerplane 115. 
However, the invention is not limited to computing systems employing a symmetrical 
physical arrangement. Thus, in some alternative embodiments, the physical arrangement of 

12 the system control boards 105(1), 105(2), the centerplane 1 15, and the system boards 1 10(1)- 

b 1 lO(w) may be asymmetrical. 

M= Returning to Figure 3C, the switches 380 route the two halves of a transaction to each 

of the arbiters 331(1), 331(2) over the paths 360(1), 360(2), respectively. This routing is 

[M " referred to as bit-slicing the transaction. In a bit-slicing arrangement, any two or more 

15 components that handle portions of the same transaction must act on those portions in the 

same way at the same time. This simultaneous action is termed lockstep operation. For 
transactions that cross domains, it is possible that an error in a transaction in one domain may 
result in the loss of the lockstep operation of the arbiters 331(1), 331(2) with respect to the 
ports 335(l)-335(m), 340(l)-340(m) used by another domain. Accordingly, in cases where 

20 transactions are received on ports 335(l)-335(m) that are shared by more than one domain, an 

additional check is made to prevent a hardware failure in one domain that results in an errant 
transaction from causing a problem in another domain due to loss of lockstep. This 
monitoring of ports 335(1 )-335(m) shared among different domains provides additional 
separation of the domains with respect to fault tolerance. Shared-domain ports 335(1)- 

25 335(m) include ports 335(l)-335(m) that are authorized to communicate transactions to more 
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than one domain. Transactions issuing from ports 335(l)-335(m) associated with a split 
expander are also treated as issuing from shared-domain ports 335(l)-335(m), because the 
constituent system board 200 and I/O board 205 reside in different domains, and it is not 
readily apparent whether an individual transaction at the split expander will cross domains. 

5 Because the transactions are bit-sliced and sent in parallel over the paths 360(1), 

360(2), it is possible that one of the transaction halves may have an error in the control 
information. If the non-errant portion of the transaction was allowed to post and the errant 
portion was inhibited, the arbiters 331(1), 331(2) would no longer be in lockstep with one 
another. The loss of lockstep could affect other domains in the computing system 100. 

lift Consider a system configuration having three domains A, B, and C. An error in a 

transaction originating from a split expander associated with domains A and B may result in 
the loss of lockstep for all ports 335(l)-335(m) associated with domains A and B. Another 

J* split expander may be associated with domains B and C. The loss of lockstep for the ports 

335(1 )-335(m) associated with domains A and B would eventually spread to domain C due to 

15 the second split expander. Thus, all ports 335(l)-335(m) associated with domain C could 

then lose lockstep. 

To detect problems with transactions that could affect multiple domains, the arbiters 
331(1), 331(2) impose a two-cycle delay for all transactions associated with shared-domain 
ports 335(l)-335(m) so that they may perform an error check. The arbiters 331(1), 331(2) 

20 exchange colleague notification signals to communicate a failed error check. In the 

illustrated embodiment, the arbiters 331(1), 331(2) perform a parity check on the control 
information associated with each half of the transaction. Parity checks allow for detection of 
errors, but not correction. The payload portion of the transaction is protected with ECC code 
so errors in the payload may be detected and possibly corrected downstream without 

25 compromising other domains. However, errant control information does have the potential to 
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affect other domains, so any transactions with errors in the control information are not posted 
to the crossbar switches 332(1), 332(2). 

The delay allows the arbiters 331(1), 331(2) to check transactions from shared- 
domain ports 335(l)-335(m) for parity before the transactions are acted upon. The arbiters 

5 331(1), 331(2), drop the transaction if either half is found to be in error. If either arbiter 

331(1), 331(2) detects a parity error in the control information, it sends a colleague 
notification signal to the other arbiter 331(1), 331(2). If both arbiters 331(1), 331(2) detect a 

;i parity error, they both send colleague notification signals. 

jf Upon receipt of a colleague notification signal from another arbiter 331(1), 331(2), 

ll both arbiters 331(1), 331(2) drop the current transaction and block the port that issued the 

errant transactions from issuing any further transactions. Each port on the interconnect 300, 
305, 310 has an associated mask that specifies which of the other ports it may access. The 
arbiters 331(1), 331(2) may disable further transactions from the port associated with the 
errant transaction by setting the mask for all other ports such that the errant port is blocked. 

15 In the illustrated embodiment, the colleague notification signal is a serial bit-stream 

consisting of two start bits, both ones, a five bit binary port number identifier, and a three bit 
check-sum. The checksum is the binary sum of the start bits and the port number. The 
checksum does not exceed six. Each arbiter 331(1), 331(2) sends two copies of this signal, in 
phase, to its colleague. Since more than one port in an arbiter 331(1), 331(2) can request 

20 access to colleague notification at a time, the notification circuits may implement an 

arbitration function that selectively allows one port at a time to originate a serial notify. 

The receiving arbiter 331(1), 331(2) will immediately pause when the first bit of a 
colleague notification signal is seen to be a one. Thereafter, the pause is held by the 
receiving circuits until the serial notify can be decoded. When the port of interest is decoded 

25 and the checksum found valid, the pause to all domains is released, and the affected domain 
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is directed to perform a stop command. The affected port also flushes the offending 
transaction by not allowing it to be pushed into a receive buffer, and/or by not allowing it to 
request arbitration service. 

Upon receipt of a colleague notification message, the arbiters 331(1), 331(2) check 
5 each of the two serial bit-streams for a good notify, and then compare the two notifies. A 

good notify has the following characteristics: 

• Two start bits in a row, both ones; 

;;4 • A legal 5-bit port number; legal = 0x00000 through 0x10001 ; and 

• A 3 -bit checksum equal to the checksum of the above: 

|i Following identification of a valid notification event, the arbiter 331(1), 331(2) that 

identified the error sends an error message to the system control board 105(1), 105(2) to 
f allow for further error processing. The system control board 105(1), 105(2) may also 

dynamically reconfigure the computing system 100 in view of the disabled port. 

Turning now to Figure 4, a simplified block diagram of a method for arbitrating 

15 transactions between domains in a computer system in accordance with one embodiment of 

the present invention is shown. In block 400, an interconnect having a plurality of ports for 
communicating transactions between a plurality of domains in a computing system is 
provided. Each port is associated with a subset of the domains. The interconnect includes a 
first signal path for transmitting a first portion of the transaction and a second signal path for 

20 transmitting a second portion of the transaction. In block 410 a transaction issued from one 

of ports associated with more than one of the domains is identified. In block 420, an error in 
one of the first and second portions of the transaction is identified. In block 430 the 
transaction is canceled responsive to identifying the error. 

Identifying errors in transactions issuing from shared-domain ports 335(l)-335(m) 

25 and preventing such transactions from being acted upon improves the stability of the 
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computing system 100. Such oversight increases the robustness of individual domains, as 
they are less susceptible to faults in other domains. 

The particular embodiments disclosed above are illustrative only, as the invention 
may be modified and practiced in different but equivalent manners apparent to those skilled 
in the art having the benefit of the teachings herein. Furthermore, no limitations are intended 
to the details of construction or design herein shown, other than as described in the claims 
below. It is therefore evident that the particular embodiments disclosed above may be altered 
or modified and all such variations are considered within the scope and spirit of the invention. 
Accordingly, the protection sought herein is as set forth in the claims below. 
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